SEQUENCE LISTING 



1) GENERAL INFORMATION: 

(i) APPLICANT: Antelman, Douglas 

Gregory, Richard J. 
Wils, Kenneth N. 

(ii) TITLE OF INVENTION: Tissue Specific Expression of 
Retinoblastoma Protein 

(iii) NUMBER OF SEQUENCES: 46 

( i V ) CORRES PONDENCE ADDRES S : 

(A) ADDRESSEE : TOWNS END and TOWNS END and CREW LLP 

(B) STREET : Two Embarcadero Center , 8th Floor 

(C) CITY: San Francisco 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94111 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/801,092 

(B) FILING DATE: 14-FEB-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/751,517 

(B) FILING DATE: 15-NOV-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Fitts, Renee A. 

(B) REGISTRATION NUMBER: 35,136 

(C) REFERENCE/DOCKET NUMBER: 016 93 0-00102 0 

(ix) TELECOMMUNICATION INFORMATION: 
^ (A) TELEPHONE: 415-576-0200 
(B) TELEFAX: 703-576-0300 



2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



Met Ala Leu Ala Gly Ala Pro Ala Gly Gly Pro Cvs Ala Pro Ala Leu 
1 5 .10 15 

Glu Ala Leu Leu Gly Ala Gly Ala Leu Arg Leu Leu Asp Ser Ser Gin 
20 25 30 

lie Val lie lie Ser Ala Ala Gin Asp Ala Ser Ala Pro Pro Ala Pro 
35 40 45 

Thr Gly Pro Ala Ala Pro Ala Ala Gly Pro Cys Asp Pro Asp Leu Leu 
50 55 60 

Leu Phe Ala Thr Pro Gin Ala Pro Arg Pro Thr Pro Ser Ala Pro Arg 
65 . 70 . 75 80 

Pro Ala Leu Gly Arg Pro Pro Val Lys Arg Arg Leu Asp Leu Glu Thr 
85 90 95 

Asp His Gin Tyr Leu Ala Glu Ser Ser Gly Pro Ala Arg Gly Arg Gly 
100 105 110 

Arg His Pro Gly Lys Gly Val Lys Ser Pro Gly Glu Lys Ser Arg Tyr 
115 120 125 

Glu Thr Ser Leu Asn Leu Thr Thr Lys Arg Phe Leu Glu Leu Leu Ser 
130 135 140 

His Ser Ala Asp Gly Val Val Asp Leu Asn Trp Ala Ala Glu Val Leu 
145 150 155 160 

Lys Val Gin Lys Arg Arg lie Tyr Asp lie Thr Asn Val Leu Glu Gly 
165 170 175 

lie Gin Leu lie Ala Lys Lys Ser Lys Asn His lie Gin Trp Leu Gly 
180 185 190 

Ser His Thr Thr Val Gly Val Gly Gly Arg Leu Glu Gly Leu Thr Gin 
195 200 205 

Asp Leu Arg Gin Leu Gin Glu Ser Glu Gin Gin Leu Asp His Leu Met 
210 215 220 

Asn lie Cys Thr Thr Gin Leu Arg Leu Leu Ser Glu Asp Thr Asp Ser 
225 230 235 240 

Gin Arg Leu Ala Tyr Val Thr Cys Gin Asp Leu Arg Ser lie Ala Asp 
245 250 255 

Pro Ala Glu Gin Met Val Met Val lie Lys Ala Pro Pro Glu Thr Gin 
260 265 270 

Leu Gin Ala Val Asp Ser Ser Glu Asn Phe Gin lie Ser Leu Lys Ser 
275 280 285 

Lys Gin Gly Pro lie Asp Val Phe Leu Cys Pro Glu Glu Thr Val Gly 
290 295 300 

Gly lie Ser Pro Gly Lys Thr Pro Ser Gin Glu Val Thr Ser Glu Glu 
305 310 315 320 

Glu Asn Arg Ala Thr Asp Ser Ala Thr lie Val Ser Pro Pro Pro Ser 
325 330 335 



♦ 
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Ser Pro Pro Ser Ser Leu Thr Thr Asp Pro Ser Gin Ser Leu Leu Ser 
340 345 350 

Leu Glu Gin Glu Pro Leu Leu Ser Arg Met Gly Ser Leu Arg Ala Pro 
355 360 365 

Val Asp Glu Asp Arg Leu Ser Pro Leu Val Ala Ala Asp Ser Leu Leu 
370 375 380 

Glu His Val Arg Glu Asp Phe Ser Gly* Leu Leu Pro Glu Glu Phe lie 
385 390 395 400 

Ser Leu Ser Pro Pro His Glu Ala Leu Asp Tyr His Phe Gly Leu Glu 
405 410 415 

Glu Gly Glu Gly lie Arg Asp Leu Phe Asp Cys Asp Phe Gly Asp Leu 
420 425 430 

Thr Pro Leu Asp Phe 
435 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
IhJ (a) LENGTH: 2 517 base pairs 

ill (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



P (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

ll GGAATTCCGT GGCCGGGACT TTGCAGGCAG CGGCGGCCGG GGGCGGAGCG GGATCGAGCC 60 

r — ' 

CTCGCCGAGG CCTGCCGCCA TGGGCCCGCG CCGCCGCCGC CGCCTGTCAC CCGGGCCGCG 12 0 

Jp CGGGCCGTGA GCGTCATGGC CTTGGCCGGG GCCCCTGCGG GCGGCCCATG CGCGCCGGCG 18 0 

CTGGAGGCCC TGCTCGGGGC CGGCGCGCTG CGGCTGCTCG ACTCCTCGCA GATCGTCATC 24 0 

ATCTCCGCCG CGCAGGACGC CAGCGCCCCG CCGGCTCCCA CCGGCCCCGC GGCGCCCGCC 3 00 

GCCGGCCCCT GCGACCCTGA CCTGCTGCTC TTCGCCACAC CGCAGGCGCC CCGGCCCACA 3 60 

CCCAGTGCGC CGCGGCCCGC GCTCGGCCGC CCGCCGGTGA AGCGGAGGCT GGACCTGGAA 42 0 

ACTGACCATC AGTACCTGGC CGAGAGCAGT GGGCCAGCTC GGGGCAGAGG CCGCCATCCA 4 80 

GGAAAAGGTG TGAAATCCCC GGGGGAGAAG TCACGCTATG AGACCTCACT GAATCTGACC 54 0 

ACCAAGCGCT TCCTGGAGCT GCTGAGCCAC TCGGCTGACG GTGTCGTCGA CCTGAACTGG 600 

GCTGCCGAGG TGCTGAAGGT GCAGAAGCGG CGCATCTATG ACATCACCAA CGTCCTTGAG 660 

GGCATCCAGC TCATTGCCAA GAAGTCCAAG AACCACATCC AGTGGCTGGG CAGCCACACC 72 0 

ACAGTGGGCG TCGGCGGACG GCTTGAGGGG TTGACCCAGG ACCTCCGACA GCTGCAGGAG 78 0 

AGCGAGCAGC AGCTGGACCA CCTGATGAAT ATCTGTACTA CGCAGCTGCG CCTGCTCTCC 840 
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GAGGACACTG ACAGCCAGCG CCTGGCCTAC GTGACGTGTC AGGACCTTCG TAGCATTGCA 900 

GACCCTGCAG AGCAGATGGT TATGGTGATC AAAGCCCCTC CTGAGACCCA GCTCCAAGCC 96 0 

GTGGACTCTT CGGAGAACTT TCAGATCTCC CTTAAGAGCA AACAAGGCCC GATCGATGTT 1020 

TTCCTGTGCC CTGAGGAGAC CGTAGGTGGG ATCAGCCCTG GGAAGACCCC ATCCCAGGAG 108 0 

GTCACTTCTG AGGAGGAGAA CAGGGCCACT GACTCTGCCA CCATAGTGTC ACCACCACCA 114 0 

TCATCTCCCC CCTCATCCCT CACCACAGAT CCCAGCCAGT CTCTACTCAG CCTGGAGCAA 1200 

GAACCGCTGT TGTCCCGGAT GGGCAGCCTG CGGGCTCCCG TGGACGAGGA CCGCCTGTCC 1260 

CCGCTGGTGG CGGCCGACTC GCTCCTGGAG CATGTGCGGG AGGACTTCTC CGGCCTCCTC 1320 

CCTGAGGAGT TCATCAGCCT TTCCCCACCC CACGAGGCCC TCGACTACCA CTTCGGCCTC 13 8 0 

GAGGAGGGCG AGGGCATCAG AGACCTCTTC GACTGTGACT TTGGGGACCT CACCCCCCTG 144 0 

GATTTCTGAC AGGGCTTGGA GGGACCAGGG TTTCCAGAGT AGCTCACCTT GTCTCTGCAG 15 00 

112 CCCTGGAGCC CCCTGTCCCT GGCCGTCCTC CCAGCCTGTT TGGAAACATT TAATTTATAC 156 0 

LU CCCTCTCCTC TGTCTCCAGA AGCTTCTAGC TCTGGGGTCT GGCTACCGCT AGGAGGCTGA 162 0 

f,!^ GCAAGCCAGG AAGGGAAGGA GTCTGTGTGG TGTGTATGTG CATGCAGCCT ACACCCACAC 168 0 

Uh GTGTGTACCG GGGGTGAATG TGTGTGAGCA TGTGTGTGTG CATGTACCGG GGAATGAAGG 1740 

ffi TGAACATACA CCTCTGTGTG TGCACTGCAG ACACGCCCCA GTGTGTCCAC ATGTGTGTGC 18 0 0 

" ATGAGTCCAT CTCTGCGCGT GGGGGGGCTC TAACTGCACT TTCGGCCCTT TTGCTCGTGG 186 0 

irl GGTCCCACAA GGCCCAGGGC AGTGCCTGCT CCCAGAATCT GGTGCTCTGA CCAGGCCAGG 192 0 

TGGGGAGGCT TTGGCTGGCT GGGCGTGTAG GACGGTGAGA GCACTTCTGT CTTAAAGGTT 1980 

ip TTTTCTGATT GAAGCTTTAA TGGAGCGTTA TTTATTTATC GAGGCCTCTT TGGTGAGCCT 2 04 0 

''^^ GGGGAATCAG CAAAAGGGGA GGAGGGGTGT GGGGTTGATA CCCCAACTCC CTCTACCCTT 2100 

GAGCAAGGGC AGGGGTCCCT GAGCTGTTCT TCTGCCCCAT ACTGAAGGAA CTGAGGCCTG 2160 

GGTGATTTAT TTATTGGGAA AGTGAGGGAG GGAGACAGAC TGACTGACAG CCATGGGTGG 2220 

TCAGATGGTG GGGTGGGCCC TCTCCAGGGG GCCAGTTCAG GGCCCAGCTG CCCCCCAGGA 228 0 

TGGATATGAG ATGGGAGAGG TGAGTGGGGG ACCTTCACTG ATGTGGGCAG GAGGGGTGGT 2 34 0 

GAAGGCCTCC CCCAGCCCAG ACCCTGTGGT CCCTCCTGCA GTGTCTGAAG CGCCTGCCTC 2400 

CCCACTGCTC TGCCCCACCC TCCAATCTGC ACTTTGATTT GCTTCCTAAC AGCTCTGTTC 246 0 

CCTCCTGCTT TGGTTTTAAT AAATATTTTG ATGACGTTAA AAAAAGGAAT TCGATAT 2 517 
(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 994 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 





TTCCGGTTTT 


TCTCAGGGGA 


CGTTGAAATT 


ATTTTTGTAA 


CGGGAGTCGG 


GAGAGGACGG 


60 




GGCGTGCCCC 


GCGTGCGCGC 


GCGTCGTCCT 


CCCCGGCGCT 


CCTCCACAGC 


TCGCTGGCTC 


120 




CCGCCGCGGA 


AAGGCGTCAT 


GCCGCCCAAA 


ACCCCCCGAA 


AAACGGCCGC 


CACCGCCGCC 


180 




GCTGCCGCCG 


CGGAACCCCC 


GGCACCGCCG 


CCGCCGCCCC 


CTCCTGAGGA 


GGACCCAGAG 


240 




CAGGACAGCG 


GCCCGGAGGA 


CCTGCCTCTC 


GTCAGGCTTG 


AGTTTGAAGA 


AACAGAAGAA 


300 




CCTGATTTTA 


CTGCATTATG 


TCAGAAATTA 


AAGATACCAG 


ATCATGTCAG 


AGAGAGAGCT 


360 




TGGTTAACTT 


GGGAGAAAGT 


TTCATCTGTG 


GATGGAGTAT 


TGGGAGGTTA 


TATTCAAAAG 


420 


AAAAAGGAAC 


TGTGGGGAAT 


CTGTATCTTT 


ATTGCAGCAG 


TTGACCTAGA 


TGAGATGTCG 


480 


• - J 


TTCACTTTTA 


CTGAGCTACA 


GAAAAACATA 


GAAATCAGTG 


TCCATAAATT 


CTTTAACTTA 


540 


hi. 


CTAAAAGAAA 


TTGATACCAG 


TACCAAAGTT 


GATAATGCTA 


TGTCAAGACT 


GTTGAAGAAG 


600 


TATGATGTAT 


TGTTTGCACT 


CTTCAGCAAA 


TTGGAAAGGA 


CATGTGAACT 


TATATATTTG 


660 




ACACAACCCA 


GCAGTTCGAT 


ATCTACTGAA 


ATAAATTCTG 


CATTGGTGCT 


AAAAGTTTCT 


720 


P 


TGGATCACAT 


TTTTATTAGC 


TAAAGGGGAA 


GTATTACAAA 


TGGAAGATGA 


TCTGGTGATT 


780 


In 


TCATTTCAGT 


TAATGCTATG 


TGTCCTTGAC 


TATTTTATTA 


AACTCTCACC 


TCCCATGTTG 


840 




CTCAAAGAAC 


CATATAAAAC 


AGCTGTTATA 


CCCATTAATG 


GTTCACCTCG 


AACACCCAGG 


900 




CGAGGTCAGA 


ACAGGAGTGC 


ACGGATAGCA 


AAACAACTAG 


AAAATGATAC 


AAGAATTATT 


960 




GAAGTTCTCT 


GTAAAGAACA 


TGAATGTAAT 


ATAGATGAGG 


TGAAAAATGT 


TTATTTCAAA 


1020 




AATTTTATAC 


CTTTTATGAA 


TTCTCTTGGA 


CTTGTAACAT 


CTAATGGACT 


TCCAGAGGTT 


1080 




GAAAATCTTT 


CTAAACGATA 


CGAAGAAATT 


TATCTTAAAA 


ATAAAGATCT 


AGATGCAAGA 


1140 




TTATTTTTGG 


ATCATGATAA 


AACTCTTCAG 


ACTGATTCTA 


TAGACAGTTT 


TGAAACACAG 


1200 




AGAACACCAC 


GAAAAAGTAA 


CCTTGATGAA 


GAGGTGAATG 


TAATTCCTCC 


ACACACTCCA 


1260 




GTTAGGACTG 


TTATGAACAC 


TATCCAACAA 


TTAATGATGA 


TTTTAAATTC 


AGCAAGTGAT 


1320 




CAACCTTCAG 


AAAATCTGAT 


TTCCTATTTT 


AACAACTGCA 


CAGTGAATCC 


AAAAGAAAGT 


1380 




ATACTGAAAA 


GAGTGAAGGA 


TATAGGATAC 


ATCTTTAAAG 


AGAAATTTGC 


TAAAGCTGTG 


1440 




GGACAGGGTT 


GTGTCGAAAT 


TGGATCACAG 


CGATACAAAC 


TTGGAGTTCG 


CTTGTATTAC 


1500 




CGAGTAATGG 


AATCCATGCT 


TAAATCAGAA 


GAAGAACGAT 


TATCCATTCA 


AAATTTTAGC 


1560 




AAACTTCTGA 


ATGACAACAT 


TTTTCATATG 


TCTTTATTGG 


CGTGCGCTCT 


TGAGGTTGTA 


1620 
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ATGGCCACAT ATAGCAGAAG TACATCTCAG AATCTTGATT CTGGAACAGA TTTGTCTTTC 16 8 0 

CCATGGATTC TGAATGTGCT TAATTTAAAA GCCTTTGATT TTTACAAAGT GATCGAAAGT 174 0 

TTTATCAAAG CAGAAGGCAA CTTGACAAGA GAAATGATAA AACATTTAGA ACGATGTGAA 18 00 

CATCGAATCA TGGAATCCCT TGCATGGCTC TCAGATTCAC CTTTATTTGA TCTTATTAAA 186 0 

CAATCAAAGG ACCGAGAAGG ACCAACTGAT CACCTTGAAT CTGCTTGTCC TCTTAATCTT 192 0 

CCTCTCCAGA ATAATCACAC TGCAGCAGAT ATGTATCTTT CTCCTGTAAG ATCTCCAAAG 1980 

AAAAAAGGTT CAACTACGCG TGTAAATTCT ACTGCAAATG CAGAGACACA AGCAACCTCA 2 04 0 

GCCTTCCAGA CCCAGAAGCC ATTGAAATCT ACCTCTCTTT CACTGTTTTA TAAAAAAGTG 2100 

TATCGGCTAG CCTATCTCCG GCTAAATACA CTTTGTGAAC GCCTTCTGTC TGAGCACCCA 2160 

GAATTAGAAC ATATCATCTG GACCCTTTTC CAGCACACCC TGCAGAATGA GTATGAACTC 2220 

ATGAGAGACA GGCATTTGGA CCAAATTATG ATGTGTTCCA TGTATGGCAT ATGCAAAGTG 22 8 0 

Q AAGAATATAG ACCTTAAATT CAAAATCATT GTAACAGCAT ACAAGGATCT TCCTCATGCT 2340 

U\ GTTCAGGAGA CATTCAAACG TGTTTTGATC AAAGAAGAGG AGTATGATTC TATTATAGTA 24 00 

M TTCTATAACT CGGTCTTCAT GCAGAGACTG AAAACAAATA TTTTGCAGTA TGCTTCCACC 2460 

in 

AGGCCCCCTA CCTTGTCACC AATACCTCAC ATTCCTCGAA GCCCTTACAA GTTTCCTAGT 252 0 

U TCACCCTTAC GGATTCCTGG AGGGAACATC TATATTTCAC CCCTGAAGAG TCCATATAAA 258 0 

- ATTTCAGAAG GTCTGCCAAC ACCAACAAAA ATGACTCCAA GATCAAGAAT CTTAGTATCA 264 0 

ATTGGTGAAT CATTCGGGAC TTCTGAGAAG TTCCAGAAAA TAAATCAGAT GGTATGTAAC 2 700 

AGCGACCGTG TGCTCAAAAG AAGTGCTGAA GGAAGCAACC CTCCTAAACC ACTGAAAAAA 2 76 0 

~Q CTACGCTTTG ATATTGAAGG ATCAGATGAA GCAGATGGAA GTAAACATCT CCCAGGAGAG 28 2 0 

XCCAAATTTC AGCAGAAACT GGCAGAAATG ACTTCTACTC GAACACGAAT GCAAAAGCAG 2 88 0 

AAAATGAATG ATAGCATGGA TACCTCAAAC AAGGAAGAGA AATGAGGATC TCAGGACCTT 294 0 

GGTGGACACT GTGTACACCT CTGGATTCAT TGTCTCTCAC AGATGTGACT GTAT 2 9 94 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 928 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) 
Met 



SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala Ala Ala 
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15 10 15 

Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu Glu Asp 
20 25 30 

Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu Glu 
35 40 4S 

Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys Leu 
50 55 60 

Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys 
65 70 75 80 

Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr He Gin Lys Lys Lys 
85 90 95 

Glu Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu Asp Glu 
100 105 110 

Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He Ser Val 
115 120 125 

His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr Lys Val 
130 135 140 

Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala 
145 150 155 160 

Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu Thr Gin 
165 170 . 175 

Pro Ser Ser Ser He Ser Thr Glu He Asn Ser Ala Leu Val Leu Lys 
180 185 _ 190 

Val Ser Trp He Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met 
195 200 205 

Glu Asp Asp Leu Val He Ser Phe Gin Leu Met Leu Cys Val Leu Asp 
210 215 220 

Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys 
225 230 235 240 

Thr Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly 
245 250 255 

Gin Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp Thr Arg 
260 265 270 

He He Glu Val Leu Cys Lys Glu His Glu Cys Asn He Asp Glu Val 
275 280 285 

Lys Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly 
290 295 300 

Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg 
305 310 315 320 

Tyr Glu Glu He Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe 
325 330 335 
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Leu Asp His ASD Lys Thr Leu Gin Thr Asp Ser lie Asp Ser Phe Glu 
340 345 350 

Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val 
355 360 365 

lie Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr lie Gin Gin 
370 375 380 

Leu Met Met He Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu 
385 390 395 400 

He Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser He Leu 
405 410 • 415 

Lvs ArcT Val Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe Ala Lys 
420 425 430 

Ala Val Gly Gin Gly Cys Val Glu, He Gly Ser Gin Arg Tyr Lys Leu 
435 440 445 

Gly Val Arg Leu Tyr Tyr Arg. Val Met Glu Ser Met Leu Lys Ser Glu 
450 ■ 455 460 

Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn 
465 470 475 480 

He Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala 
485 490 495 

Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu 
500 505 510 

Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe 
515 520 525 

Tyr Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn Leu Thr Arg 
530 535 540 

Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He Met Glu Ser 
545 550 555 560 

Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys Gin Ser 
565 570 575 

Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu 
580 585 590 

Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser 
595 600 605 

Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser 
6X0 615 620 

Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys 
625 630 635 640 

Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg 
645 650 655 

Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu 
660 665 670 



40 



His Pro Glu Leu Glu His lie lie Trp Thr Leu Phe Gin His Thr Leu 
575 680 685 

Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin lie Met 
690 695 700 

Met Cys Ser Met Tyr Gly lie Cys Lys Val Lys Asn lie Asp Leu Lys 
705 710 715 720 

Phe Lys lie lie Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin 
725 730 735 

Glu Thr Phe Lys Arg Val Leu lie Lys Glu Glu Glu Tyr Asp Ser lie 
740 745 750 

lie Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn He 
755 760 765 

Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro He Pro His 
770 775 780 

He Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg He Pro 
78 5 7 90 795 800 

Gly Gly Asn He Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys He Ser 
805 BIO 815 



i - 3 



m 

i==K 



^9 



Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg He Leu 
820 825 830 

Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys He 
835 840 845 



Q Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu 

m 850 855 860 

^- Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp He Glu 

865 870 875 880 

Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys 
885 890 895 

Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin 
900 905 910 

Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
915 920 925 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 209.. 250 
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(ix) FEATITRE: 

(A) NAME/KEY:' CDS 

(B) LOCATION: 254.. 289 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 293.. 505 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 509.. 514 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 5 18.. 520 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 524.-658 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 662'. .691 

(ix) FEATURE: 

U (A) NAME/KEY: CDS 

^ (B) LOCATION: 695.. 748 

(ix) FEATURE: 

U (A) NAME/KEY: CDS 

ffl (B) LOCATION: 752 .. 781 

(ix) FEATURE: 

Q (A) NAME/KEY: CDS 

ifl (B) LOCATION: 785.-829 

%tj (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1132.-1134 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1138.. 1149 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 833 -. 862 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



GACGGATCGG 


GAGATCTCCC 


GATCCCCTAT GGTCGACTCT 


CAGTACAATC 


TGCTCTGATG 


60 


CCGCATAGTT 


AAGCCAGTAT 


CTGCTCCCTG CTTGTGTGTT 


GGAGGTCGCT 


GAGTAGTGCG 


120 


CGAGCAAAAT 


TTAAGCTACA 


ACAAGGCAAG GCTTGACCGA 


CAATTGCATG 


AAGAATCTGC 


180 


TTAGGGTTAG 


GCGTTTTGCG 


CTGCTTCG CGA TGT ACG GGC CAG ATA 


TAC GCG 


232 



1 5 

TTG ACA TTG ATT ATT GAC TAG TTA TTA ATA GTA ATC AAT TAC GGG GTC 28 0 



Leu Thr Leu lie lie Asp Leu Leu lie Val lie Asn Tyr Gly Val 

10 15 



ATT AGT TCA TAG CCC ATA TAT GGA GTT CCG CGT TAG ATA ACT TAG GGT 
lie Ser Ser Pro lie Tyr Gly Val Pro Arg Tyr lie Thr Tyr Gly 

10 1 5 10 

AAA TGG CCC GCC TGG CTG ACC GCC CAA CGA CCC CCG CCC ATT GAC GTC 
Lys Trp Pro Ala Trp Leu Thr Ala Gin Arg Pro Pro Pro lie Asp Val 
15 20 25 

AAT AAT GAC GTA TGT TCC CAT AGT AAC GCC AAT AGG GAC TTT CCA TTG 
Asn Asn Asp Val Cys Ser His Ser Asn Ala Asn Arg Asp Phe Pro Leu 
30 35 40 

ACG TCA ATG GGT GGA CTA TTT ACG GTA AAC TGC CCA CTT GGC AGT ACA 
Thr Ser Met Gly Gly Leu Phe Thr Val Asn Cys Pro Leu Gly Ser Thr 
45 50 55 60 

TCA AGT GTA TCA TAT GCC AAG TAC GCC CCC TAT TGA CGT CAA 
Ser Ser Val Ser Tyr Ala Lys Tyr Ala Pro Tyr Arg Gin 

65 70 1 

TGA CGG TAA ATG GCC CGC CTG GCA TTA TGC CCA GTA CAT GAC CTT ATG 
Arg Met Ala Arg Leu Ala Leu Cys Pro Val His Asp Leu Met 

11 5 10 

GGA CTT TCC TAC TTG GCA GTA CAT CTA CGT ATT AGT CAT CGC TAT TAC ' 
Gly Leu Ser Tyr Leu Ala Val His Leu Arg lie Ser His Arg Tyr Tyr 
15 20 25 

CAT GGT GAT GCG GTT TTG GCA GTA CAT CAA TGG GCG TGG ATA GCG GTT 
His Gly Asp Ala Val Leu Ala Val His Gin Trp Ala Trp He Ala Val 
30 35 40 45 

TGA CTC ACG GGG ATT TCC AAG TCT CCA CCC CAT TGA CGT CAA TGG GAG 
Leu Thr Gly He Ser Lys Ser Pro Pro His Arg Gin Trp Glu 

1 5 10 1 ■ 

TTT GTT TTG GCA CCA AAA TCA ACG GGA CTT TCC AAA ATG TCG 
Phe Val Leu Ala Pro Lys Ser Thr Gly Leu Ser Lys Met Ser 
5 10 15 

TAA CAA CTC CGC CCC ATT GAC GCA AAT GGG CGG TAG CGC TGT ACG GTG 
Gin Leu Arg Pro He Asp Ala Asn Gly Arg Arg Cys Thr Val 

1 5 10 1 

GGA GGT CTA TAT AAG CAG AGC TCT CTG GCT AAC TAG AGA ACC CAC TGC 
Gly Gly Leu Tyr Lys Gin Ser Ser Leu Ala Asn Arg Thr His Cys 

5 10 15 1 

TTA CTG GCT TAT CGA AAT TAATACGACT CACTATAGGG AGACCCAAGC 
Leu Leu Ala Tyr Arg Asn 
S 10 

TTCGCGCGGG TACCACTCTC TTCCGCATCG CTGTCTGCGA GGGCCAGCTG TTGGGCTCGC 

GGTTGAGGAC AAACTCTTCG CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT 

CCGAACGGTA CTCCGCCACC GAGGGACCTG AGCGAGTCCG CATCGACCGG ATCGGAAAAC 



CTCTCGAGGC GGCCGCTGCA GTCTAGACGA ATTCGCGTAC GATATCGATG GGCCCTATT 



CTA TAG TGT CAC CTA AAT GCTAGAGCTC GCTGATCAGC CTCGACTGTG 117 9 

Leu Cys His Leu Asn 

1 1 



CCTTCTAGTT 


GCCAGCCATC 


TGTTGTTTGC 


CCCTCCCCCG 


TGCCTTCCTT 


GACCCTGGAA 


1239 


GGTGCCACTC 


CCACTGTCCT 


TTCCTAATAA 


AATGAGGAAA 


TTGCATCGCA 


TTGTCTGAGT 


1299 


AGGTGTCATT 


CTATTCTGGG 


GGGTGGGGTG 


GGGCAGGACA 


GCAAGGGGGA 


GGATTGGGAA 


1359 


GACAATAGCC 


GAAATGACCG 


ACCAAGCGAC 


GCCCAACCTG 


CCATCACGAG 


ATTTCGATTC 


1419 


CACCGCCGCC 


TTCTATGAAA 


GGTTGGGCTT 


CGGAATCGTT 


TTCCGGGACG 


CCGGCTGGAT 


1479 


GATCCTCCAG 


CGCGGGGATC 


TCATGCTGGA 


GTTCTTCGCC 


CACCCCAACT 


TGTTTATTGC 


1539 


AGCTTATAAT 


GGTTACAAAT 


AAAGCAATAG 


CATCACAAAT 


TTCACAAATA 


AAGCATTTTT 


1599 


TTCACTGCAT 


TCTAGTTGTG 


GTTTGTCCAA 


ACTCATCAAT 


GTATCTTATC 


ATGTCTGTAT 


1659 


ACCGTCGACC 


TCTAGCTAGA 


GCTTGGCGTA 


ATCATGGTCA 


TAGCTGTTTC 


CTGTGTGAAA 


1719 


TTGTTATCCG 


CTCACAATTC 


CACACAACAT 


ACGAGCCGGA 


AGCATAAAGT 


GTAAAGCCTG 


1779 


GGGTGCCTAA 


TGAGTGAGCT 


AACTCACATT 


AATTGCGTTG 


CGCTCACTGC 


CCGCTTTCCA 


1839 


GTCGGGAAAC 


CTGTCGTGCC 


AGCTGCATTA 


ATGAATCGGC 


CAACGCGCGG 


GGAGAGGCGG 


1899 


TTTGCGTATT 


GGGCGCTCTT 


CCGCTTCCTC 


GCTCACTGAC 


TCGCTGCGCT 


CGGTCGTTCG 


1959 


GCTGCGGCGA 


GCGGTATCAG 


CTCACTCAAA 


GGCGGTAATA 


CGGTTATCCA 


CAGAATCAGG 


2019 


GGATAACGCA 


GGAAAGAACA 


TGTGAGCAAA 


AGGCCAGCAA 


AAGGCCAGGA 


ACCGTAAAAA 


2079 


GGCCGCGTTG 


CTGGCGTTTT 


TCCATAGGCT 


CCGCCCCCCT 


GACGAGCATC 


ACAAAAATCG 


2139 


ACGCTCAAGT 


CAGAGGTGGC 


GAAACCCGAC 


AGGACTATAA 


AGATACCAGG 


CGTTTCCCCC 


2199 


TGGAAGCTCC 


CTCGTGCGCT 


CTCCTGTTCC 


GACCCTGCCG 


CTTACCGGAT 


ACCTGTCCGC 


2259 


CTTTCTCCCT 


TCGGGAAGCG 


TGGCGCTTTC 


TCAATGCTCA 


CGCTGTAGGT 


ATCTCAGTTC 


2319 


GGTGTAGGTC 


GTTCGCTCCA 


AGCTGGGCTG 


TGTGCACGAA 


CCCCCCGTTC 


AGCCCGACCG 


2379 


CTGCGCCTTA 


TCCGGTAACT 


ATCGTCTTGA 


GTCCAACCCG 


GTAAGACACG 


ACTTATCGCC 


2439 


ACTGGCAGCA 


GCCACTGGTA 


ACAGGATTAG 


CAGAGCGAGG 


TATGTAGGCG 


GTGCTACAGA 


2499 


GTTCTTGAAG 


TGGTGGCCTA 


ACTACGGCTA 


CACTAGAAGG 


ACAGTATTTG 


GTATCTGCGC 


. 2559 


TCTGCTGAAG 


CCAGTTACCT 


TCGGAAAAAG 


AGTTGGTAGC 


TCTTGATCCG 


GCAAACAAAC 


2619 


CACCGCTGGT 


AGCGGTGGTT 


TTTTTGTTTG 


CAAGCAGCAG 


ATTACGCGCA 


GAAAAAAAGG 


2679 


ATCTCAAGAA 


GATCCTTTGA 


TCTTTTCTAC 


GGGGTCTGAC 


GCTCAGTGGA 


ACGAAAACTC 


2739 


ACGTTAAGGG 


ATTTTGGTCA 


TGAGATTATC 


AAAAAGGATC 


TTCACCTAGA 


TCCTTTTAAA 


2799 


TTAAAAATGA 


AGTTTTAAAT 


CAATCTAAAG 


TATATATGAG 


TAAACTTGGT 


CTGACAGTTA 


2859 


CCAATGCTTA 


ATCAGTGAGG 


CACCTATCTC 


AGCGATCTGT 


CTATTTCGTT 


CATCCATAGT 


2919 
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TGCCTGACTC CCCGTCGTGT AGATAACTAC GATACGGGAG GGCTTACCAT CTGGCCCCAG 2 979 

TGCTGCAATG ATACCGCGAG ACCCACGCTC ACCGGCTCCA GATTTATCAG CAATAAACCA 3 03 9 

GCCAGCCGGA AGGGCCGAGC GCAGAAGTGG TCCTGCAACT TTATCCGCCT CCATCCAGTC 3 099 

TATTAATTGT TGCCGGGAAG CTAGAGTAAG TAGTTCGCCA GTTAATAGTT TGCGCAACGT 315 9 

TGTTGCCATT GCTACAGGCA TCGTGGTGTC ACGCTCGTCG TTTGGTATGG CTTCATTCAG 3219 

CTCCGGTTCC CAACGATCAA GGCGAGTTAC ATGATCCCCC ATGTTGTGCA AAAAAGCGGT 3279 

TAGCTCCTTC GGTCCTCCGA TCGTTGTCAG AAGTAAGTTG GCCGCAGTGT TATCACTCAT 3339 

GGTTATGGCA GCACTGCATA ATTCTCTTAC TGTCATGCCA TCCGTAAGAT GCTTTTCTGT 33 9 9 

GACTGGTGAG TACTCAACCA AGTCATTCTG AGAATAGTGT ATGCGGCGAC CGAGTTGCTC 34 5 9 

TTGCCCGGCG TCAATACGGG ATAATACCGC GCCACATAGC AGAACTTTAA AAGTGCTCAT 3 519 

CATTGGAAAA CGTTCTTCGG GGCGAAAACT CTCAAGGATC TTACCGCTGT TGAGATCCAG 3 57 9 

CI3 TTCGATGTAA CCCACTCGTG CACCCAACTG ATCTTCAGCA TCTTTTACTT TCACCAGCGT 363 9 

hj TTCTGGGTGA GCAAAAACAG GAAGGCAAAA TGCCGCAAAA AAGGGAATAA GGGCGACACG 36 99 

GAAATGTTGA ATACTCATAC TCTTCCTTTT TCAATATTAT TGAAGCATTT ATCAGGGTTA 3 75 9 

in 

TTGTCTCATG AGCGGATACA TATTTGAATG TATTTAGAAA AATAAACAAA TAGGGGTTCC 3 819 

GCGCACATTT CCCCGAAAAG TGCCACCTGA CGTC 3 8 53 

Cll (2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 
\lJ (B) type : amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Arg Cys Thr Gly Gin lie Tyr Ala Leu Thr Leu lie lie Asp 
15 10 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Leu Leu lie Val lie Asn Tyr Gly Val lie Ser Ser 
15 10 
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(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Pro lie Tyr Gly Val Pro Arg Tyr lie Thr Tyr Gly Lys Trp Pro Ala 
15 10 15 

Trp Leu Thr Ala Gin Arg Pro Pro Pro lie Asp Val Asn Asn Asp Val 
20 25 30 

Cys Ser His Ser Asn Ala Asn Arg Asp Phe Pro Leu Thr Ser Met Gly 
35 40 45 

Gly Leu Phe Thr Val Asn Cys Pro Leu Gly Ser Thr Ser Ser Val Ser 
50 55 ■ .60 

h-Q Tyr Ala Lys Tyr Ala Pro Tyr 



(2) INFORMATION FOR SEQ ID NO : 9 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

r=3 (ii) MOLECULE TYPE: protein 

f} (xi) SEQUENCE DESCRIPTION; SEQ ID NO : 9 : 

'sH Arg Gin 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Arg 
1 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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■- n 



ul 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Ala Arg Leu Ala Leu Cys Pro Val His Asp Leu Met Gly Leu Ser 
IS 10 15 

Tyr Leu Ala Val His Leu Arg lie Ser His Arg Tyr Tyr His Gly Asp 
20 25 30 

Ala Val Leu Ala Val His Gin Trp Ala Trp lie Ala Val 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Leu Thr Gly lie Ser Lys Ser Pro Pro His 
15 10 



(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



'^-"^ (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 



Arg Gin Trp Glu Phe Val Leu Ala Pro Lys Ser Thr Gly Leu Ser Lys 
1 ' 5 10 15 



Met Ser 



(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Gin Leu Arg Pro lie Asp Ala Asn Gly Arg 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Arg Cys Thr Val Gly Gly Leu Tyr Lys Gin Ser Ser Leu Ala Asn 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Q (ii) MOLECULE TYPE: protein 



bsit 



m 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 16 



Arg Thr His Cys Leu Leu Ala Tyr Arg Asn 

U1 15 10 



(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



m 

ip (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 



Leu 
1 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

Cys His Leu Asn 
1 

(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 



48 



(A) LENGTH: 4025 base pairs 

. (B) TYPE: nucleic acid 

(C) STRANDEDMESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 209.. 250 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 254.. 289 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 293.. 505 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

Vj (B) LOCATION: 509.-. 514 



(ix) FEATURE: 

(A) NAME /KEY: CDS 



(B) LOCATION: 518 .. 520 

U (ix) FEATURE: 

; , (A) NAME/KEY: CDS 

^ (B) LOCATION: 524.. 658 

= ( ix ) FEATURE : 

r=1 (A) NAME/KEY: CDS 

(B) LOCATION: 662.. 691 

M (ix) FEATURE: 

~n (A) NAME/KEY: CDS 

^ (B) LOCATION; 695.. 748 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 752.. 781 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 785.. 829 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 833 862 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1305.. 1307 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1311.. 1322 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
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GACGGATCGG GAGATCTCCC GATCCCCTAT GGTCGACTCT CAGTACAATC TGCTCTGATG 6 0 

CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT GGAGGTCGCT GAGTAGTGCG 12 0 

CGAGCAAAAT TTAAGCTACA ACAAGGCAAG GCTTGACCGA CAATTGCATG AAGAATCTGC 180 

TTAGGGTTAG GCGTTTTGCG CTGCTTCG CGA TGT ACG GGC GAG ATA TAG GCG 232 

Arg Cys Thr Gly Gin lie Tyr Ala 
1 5 

TTG ACA TTG ATT ATT GAG TAG TTA TTA ATA GTA ATC AAT TAG GGG GTC- 280 
Leu Thr Leu lie lie Asp Leu Leu He Val He Asn Tyr Gly Val 

10 15 

ATT AGT TCA TAG CCC ATA TAT GGA GTT CCG CGT TAG ATA ACT TAG GGT 32 8 

He Ser Ser Pro He Tyr Gly Val Pro Arg Tyr He Thr Tyr Gly 

10 1 5 10 

AAA TGG CCC GCC TGG CTG ACC GCC CAA CGA CCC CCG CCC ATT GAC GTC 3 76 

Lys Trp Pro Ala Trp Leu Thr Ala Gin Arg Pro Pro Pro He Asp Val 
15 20 25 

O AAT AAT GAC GTA TGT TCC CAT AGT AAC GCC AAT AGG GAC TTT CCA TTG 424 

Asn Asn Asp Val Cys Ser His Ser Asn Ala Asn Arg Asp Phe Pro Leu 

UJ 30 35 40 

?r ACG TCA ATG GGT GGA CTA TTT ACG GTA AAC TGG CCA CTT GGC AGT ACA 4 72 



Thr Ser Met Gly Gly Leu Phe Thr Val Asn Cys Pro Leu Gly Ser Thr 
45 50 55 60 



JL^ TCA AGT GTA TCA TAT GCC AAG TAG GCC CCC TAT TGA CGT CAA 514 
^- Ser Ser Val Ser Tyr Ala Lys Tyr Ala Pro Tyr Arg Gin 

= 65 70 1 



TGA CGG TAA ATG GCC GGC CTG GCA TTA TGG CCA GTA CAT GAC CTT ATG 562 
Arg Met Ala Arg Leu Ala Leu Cys Pro Val His Asp Leu Met 

11 5 10 

GGA CTT TCC TAG TTG GCA GTA CAT CTA CGT ATT AGT CAT CGC TAT TAG 610 
Gly Leu Ser Tyr Leu Ala Val His Leu Arg He Ser His Arg Tyr Tyr 
15 20 25 

CAT GGT GAT GCG GTT TTG GCA GTA CAT CAA TGG GCG TGG ATA GCG GTT 6 58 

His Gly Asp Ala Val Leu Ala Val His Gin Trp Ala Trp He Ala Val 
30 ^ 35 40 45 

TGA CTG ACG GGG ATT TCC AAG TCT CCA CCC CAT TGA CGT CAA TGG GAG 706 
Leu Thr Gly He Ser Lys Ser Pro Pro His Arg Gin Trp Glu 

1 5 10 1 

TTT GTT TTG GCA CCA AAA TCA ACG GGA CTT TCC AAA ATG TGG 74 8 

Phe val Leu Ala Pro Lys Ser Thr Gly Leu Ser Lys Met Ser 
5 10 15 

TAA CAA CTC CGC CCC ATT GAC GCA AAT GGG CGG TAG GCG TGT ACG GTG 796 
Gin Leu Arg Pro He Asp Ala Asn Gly Arg Ala Cys Thr Val 

1 5 10 1 

GGA GGT CTA TAT AAG GAG AGC TCT CTG GCT AAC TAG AGA ACC CAC TGG 844 
Gly Gly Leu Tyr Lys Gin Ser Ser Leu Ala Asn Arg Thr His Cys 

5 10 15 1 
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TTA CTG GCT TAT CGA AAT TAATACGACT CACTATAGGG AGACCCAAGC 8 92 
Leu Leu Ala Tyr Arg Asn 
5 10 

TTCGCGCGGG TACCACTCTC TTCCGCATCG CTGTCTGCGA GGGCCAGCTG TTGGGCTCGC 952 

GGTTGAGGAC AAACTCTTCG CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT 1012 

CCGAACGGTA CTCCGCCACC GAGGGACCTG AGCGAGTCCG CATCGACCGG ATCGGAAAAC 1072 

CTCTCGAGGA ACTGAAAAAC CAGAAAGTTA ACTGGTAAGT TTAGTCTTTT TGTCTTTTTA 113 2 

TTTCAGGTCC CGGATCCGGT GGTGGTGCAA ATCAAAGAAC TGCTCCTCAG TGGATGTTGC 1192 

CTTTACTTCT AGGCCTGTAC GGAAGTGTTA CTTCTGCTCT AAAAGCTGCG GAATTGTACC 12 52 

CGCGGCCGCT GCAGTCTAGA CGAATTCGCG TACGATATCG ATGGGCCCTA TT CTA 13 07 

Leu 
1 

TAG TGT CAC CTA AAT GCTAGAGCTC GCTGATCAGC CTCGACTGTG CCTTCTAGTT 13 62 
Cys His Leu Asn 
1 

GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC 1422 

CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 1482 

il CTATTCTGGG GGGTGGGGTG GGGCAGGACA GCAAGGGGGA GGATTGGGAA GACAATAGCC 1542 

'■^ GAAATGACCG ACCAAGCGAC GCCCAACCTG CCATCACGAG ATTTCGATTC CACCGCCGCC 1602 

B TTCTATGAAA GGTTGGGCTT CGGAATCGTT TTCCGGGACG CCGGCTGGAT GATCCTCCAG 1662 

CGCGGGGATC TCATGCTGGA GTTCTTCGCC CACCCCAACT TGTTTATTGC AGCTTATAAT 1722 

H GGTTACAAAT AAAGCAATAG CATCACAAAT TTCACAAATA AAGCATTTTT TTCACTGCAT 17 82 

TCTAGTTGTG GTTTGTCCAA ACTCATCAAT GTATCTTATC ATGTCTGTAT ACCGTCGACC 1842 

VLJ TCTAGCTAGA GCTTGGCGTA ATCATGGTCA TAGCTGTTTC CTGTGTGAAA TTGTTATCCG 1902 

CTCACAATTC CACACAACAT ACGAGCCGGA AGCATAAAGT GTAAAGCCTG GGGTGCCTAA 1962 

TGAGTGAGCT AACTCACATT AATTGCGTTG CGCTCACTGC CCGCTTTCCA GTCGGGAAAC 2 022 

CTGTCGTGCC AGCTGCATTA ATGAATCGGC CAACGCGCGG GGAGAGGCGG TTTGCGTATT 208 2 

GGGCGCTCTT CCGCTTCCTC GCTCACTGAC TCGCTGCGCT CGGTCGTTCG GCTGCGGCGA 2142 

GCGGTATCAG CTCACTCAAA GGCGGTAATA CGGTTATCCA CAGAATCAGG GGATAACGCA 22 02 

GGAAAGAACA TGTGAGCAAA AGGCCAGCAA AAGGCCAGGA ACCGTAAAAA GGCCGCGTTG 2262 

CTGGCGTTTT TCCATAGGCT CCGCCCCCCT GACGAGCATC ACAAAAATCG ACGCTCAAGT 2322 

CAGAGGTGGC GAAACCCGAC AGGACTATAA AGATACCAGG CGTTTCCCCC TGGAAGCTCC 2 3 82 

CTCGTGCGCT CTCCTGTTCC GACCCTGCCG CTTACCGGAT ACCTGTCCGC CTTTCTCCCT 24 4 2 

TCGGGAAGCG TGGCGCTTTC TCAATGCTCA CGCTGTAGGT ATCTGAGTTC GGTGTAGGTC 2 502 
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GTTCGCTCCA AGCTGGGCTG TGTGCACGAA CCCCCCGTTC AGCCCGACCG CTGCGCCTTA 2562 

TCCGGTAACT ATCGTCTTGA GTCCAACCCG GTAAGACACG ACTTATCGCC ACTGGCAGCA 2 622 

GCCACTGGTA ACAGGATTAG CAGAGCGAGG TATGTAGGCG GTGCTACAGA GTTCTTGAAG 268 2 

TGGTGGCCTA ACTACGGCTA CACTAGAAGG ACAGTATTTG GTATCTGCGC TCTGCTGAAG 274 2 

CCAGTTACCT TCGGAAAAAG AGTTGGTAGC TCTTGATCCG GCAAACAAAC CACCGCTGGT 28 02 

AGCGGTGGTT TTTTTGTTTG CAAGCAGCAG ATTACGCGCA GAAAAAAAGG ATCTCAAGAA 2 862 

GATCCTTTGA TCTTTTCTAC GGGGTCTGAC GCTCAGTGGA ACGAAAACTC ACGTTAAGGG 2 922 

ATTTTGGTCA TGAGATTATC AAAAAGGATC TTCACCTAGA TCCTTTTAAA TTAAAAATGA 2 982 

AGTTTTAAAT CAATCTAAAG TATATATGAG TAAACTTGGT CTGACAGTTA CCAATGCTTA 3 04 2 

ATCAGTGAGG CACCTATCTC AGCGATCTGT CTATTTCGTT CATCCATAGT TGCCTGACTC 3102 

CCCGTCGTGT AGATAACTAC GATACGGGAG GGCTTACCAT CTGGCCCCAG TGCTGCAATG 3162 

ATACCGCGAG ACCCACGCTC ACCGGGTCCA GATTTATCAG CAATAAACCA GCCAGCCGGA 3 222 

AGGGCCGAGC GCAGAAGTGG TCCTGCAACT TTATCCGCCT CCATCCAGTC TATTAATTGT 3282 
M TGCCGGGAAG CTAGAGTAAG TAGTTCGCCA GTTAATAGTT TGCGCAACGT TGTTGCCATT . 3 342 

r.^ GCTACAGGCA TCGTGGTGTC ACGCTCGTCG TTTGGTATGG CTTCATTCAG CTCCGGTTCC 34 02 

CAACGATCAA GGCGAGTTAC ATGATCCCCC ATGTTGTGCA AAAAAGCGGT TAGCTCCTTC 34 62 

GGTCCTCCGA TCGTTGTCAG AAGTAAGTTG GCCGCAGTGT TATCACTCAT GGTTATGGCA 3 522 

^ GCACTGCATA ATTCTCTTAC TGTCATGCCA TCCGTAAGAT GCTTTTCTGT GACTGGTGAG 3 582 

in 

ij, TACTCAACCA AGTCATTCTG AGAATAGTGT ATGCGGCGAC CGAGTTGCTC TTGCCCGGCG 3 64 2 

^ TCAATACGGG ATAATACCGC GCCACATAGC AGAACTTTAA AAGTGCTCAT CATTGGAAAA 3 702 

iQ CGTTCTTCGG GGCGAAAACT CTCAAGGATC TTACCGCTGT TGAGATCCAG TTCGATGTAA 3 762 

CCCACTCGTG CACCCAACTG ATCTTCAGCA TCTTTTACTT TCACCAGCGT TTCTGGGTGA 3 82 2 

GCAAAAACAG GAAGGCAAAA TGCCGCAAAA AAGGGAATAA GGGCGACACG GAAATGTTGA 3 88 2 

ATACTCATAC TCTTCCTTTT TCAATATTAT TGAAGCATTT ATCAGGGTTA TTGTCTCATG 3 94 2 

AGCGGATACA TATTTGAATG TATTTAGAAA AATAAACAAA TAGGGGTTCC GCGCACATTT 4 002 
CCCCGAAAAG TGCCACCTGA CGTC .4026 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SSQ ID NO : 2 0 : 

Arg Cys Thr Gly Gin He Tyr Ala Leu Thr Leu He He Asp 
15 10 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 21 : 

Leu Leu He Val He Asn Tyr Gly Val He Ser Ser 
15 10 

(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 
1==^ (D) TOPOLOGY: linear 

r[j (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

= Pro He Tyr Gly Val Pro Arg Tyr He Thr Tyr Gly Lys Trp Pro Ala 
1 5 10 15 

if\ rf-hr Ala Gin Arg Pro Pro Pro He Asp Val Asn Asn Asp Val 

U 20 2 5 3 0 

Cys Ser His Ser Asn Ala Asn Arg Asp Phe Pro Leu Thr Ser Met Gly 
35 40 45 

Gly Leu Phe Thr Val Asn Cys Pro Leu Gly Ser Thr Ser Ser Val Ser 
50 55 60 

Tyr Ala Lys Tyr Ala Pro Tyr 
65 70 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Arg Gin 
1 
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{2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:24: 

Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Q (ii) MOLECULE TYPE: protein 

: , (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

H Met Ala Arg Leu Ala Leu Cys Pro Val His Asp Leu Met Gly Leu Ser 

in 15 10 15 

Tyr Leu Ala Val His Leu Arg lie Ser His Arg Tyr Tyr His Gly Asp 
20 25 30 



m 



Ala Val Leu Ala Val His Gin Trp Ala Trp lie Ala Val 
35 40 45 



m 

M (2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Leu Thr Gly lie Ser Lys Ser Pro Pro His 
15 10 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
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= 3=) 



Arg Gin Trp Glu Phe Val Leu Ala Pro Lys Ser Thr Gly Leu Ser Lys 
15 10 15 



Met Ser 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) . TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Gin Leu Arg Pro lie Asp Ala Asn Gly Arg 
15 10 

Q (2) INFORMATION FOR SEQ ID NO: 29: 

f{ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
h'^ (B) TYPE: amino acid 

Ifl (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

01 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Ala Cys Thr Val Gly Gly Leu Tyr Lys Gin Ser Ser Leu Ala Asn 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO : 3 0 : 

'^5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii)^ MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Arg Thr His Cys Leu Leu Ala Tyr Arg Asn 
15 10 

(2) INFORMATION FOR SEQ ID NO : 3 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
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Leu 
1 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 

Cys His Leu Asn 

1 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 424 9 base pairs 
rj (B) TYPE: nucleic acid 

fr^ (C) STRANDEDNESS : single 

ri ( D ) TOPOLOGY : 1 inear 



in 



(ii) MOLECULE TYPE: DNA (genomic) 



I . (ix) FEATURE: 

["^ (A) NAME/KEY: CDS 

(B) LOCATION: 209.. 250 

f-=^ (ix) FEATURE: 

(A) NAME/KEY : CDS 

(B) LOCATION: 254.-289 

ifl (ix) FEATURE: 

(A) NAME/KEY: CDS 

''^^ (B) LOCATION: 293 505 

(ix) FEATURE; 

(A) NAME/KEY: CDS 

(B) LOCATION: 509.. 514 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 518 520 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 524.. 658 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 662.. 691 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 695 748 



(ix) 



FEATURE : 
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(A) NAME /KEY: CDS 

(B) LOCATION: 752.-781 

(ix) FEATURE; 

(A) NAME /KEY: CDS 

(B) LOCATION: 785.. 829 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 833 862 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1528.. 1530 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1534.. 1545 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GACGGATCGG GAGATCTCCC GATCCCCTAT GGTCGACTCT CAGTACAATC TGCTCTGATG 60 

CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT GGAGGTCGCT GAGTAGTGCG 12 0 

CGAGCAAAAT TTAAGCTACA ACAAGGCAAG GCTTGACCGA CAATTGCATG AAGAATCTGC 180 

TTAGGGTTAG GCGTTTTGCG CTGCTTCG CGA TGT ACG GGC CAG ATA TAC GCG 232 

Arg Cys Thr Gly Gin lie Tyr Ala 



'r=^ - 1 5 

= TTG ACA TTG ATT ATT GAC TAG TTA TTA ATA GTA ATC AAT TAC GGG GTC 28 0 

^ Leu Thr Leu lie lie Asp Leu Leu lie Val lie Asn Tyr Gly Val 

10 15 

M ATT AGT TCA TAG CCC ATA TAT GGA GTT CCG CGT TAC ATA ACT TAC GGT 32 8 

■n He Ser Ser Pro He Tyr Gly Val Pro Arg Tyr He Thr Tyr Gly 

10 1 5 10 

^.0 AAA TGG CCC GCC TGG CTG ACC GCC CAA CGA CCC CCG CCC ATT GAC GTC 3 76 

Lys Trp Pro Ala Trp Leu Thr Ala Gin Arg Pro Pro Pro He Asp Val 
15 20 25 

AAT AAT GAC GTA TGT TCC CAT AGT AAC GCC AAT AGG GAC TTT CCA TTG 424 
Asn Asn Asp Val Cys Ser His Ser Asn Ala Asn Arg Asp Phe Pro Leu 
30 35 40 

ACG TCA ATG GGT GGA CTA TTT ACG GTA AAC TGC CCA CTT GGC AGT ACA 4 72 

Thr Ser Met Gly Gly Leu Phe Thr Val Asn Cys Pro Leu Gly Ser Thr 
45 50 55 60 

TCA AGT GTA TCA TAT GCC AAG TAC GCC CCC TAT TGA CGT CAA 514 
Ser Ser Val Ser Tyr Ala Lys Tyr Ala Pro Tyr Arg Gin 

65 70 1 

TGA CGG TAA ATG GCC CGC CTG GCA TTA TGC CCA GTA CAT GAC CTT ATG 562 
Arg Met Ala Arg Leu Ala Leu Cys Pro Val His Asp Leu Met 

11 5 10 

GGA CTT TCC TAC TTG GCA GTA CAT CTA CGT ATT AGT CAT CGC TAT TAC 610 
Gly Leu Ser Tyr Leu Ala Val His Leu Arg He Ser His Arg Tyr Tyr 
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15 20 . . 25 



CAT 
Kis 
30 


GGT 
Gly 


GAT 
Asp 


GCG 
Ala 


GTT 
Val 


TTG 
Leu 
35 


GCA 
Ala 


GTA 
Val 


CAT 
His 


CAA 
Gin 


TGG 
Trp 
40 


GCG 
Ala 


TGG 
Trp 


ATA 
He 


GCG 
Ala 


GTT 
Val 
45 


658 


TGA 


CTC 
Leu 
1 


ACG 
Thr 


GGG 
Gly 


ATT 
He 


TCC 
Ser 
5 


AAG 
Lys 


TCT 
Ser 


CCA 
Pro 


CCC 
Pro 


CAT 
His 
10 


TGA 


CGT 
Arg 
1 


CAA 
Gin 


TGG 
Trp 


GAG 
Glu 


706 


TTT 
Phe 
5 


GTT 
Val 


TTG 
Leu 


GCA 
Ala 


CCA 
Pro 


AAA 
Lys 
10 


TCA 
Ser 


ACG 
Thr 


GGA 
Gly 


CTT 
Leu 


TCC 
Ser 
15 


AAA 
Lys 


ATG 
Met 


TCG 
Ser 






748 


TAA 


CAA 
Gin 
1 


CTC 
Leu 


CGC 
Arg 


CCC 
Pro 


ATT 
He 

5 


GAC 
Asp 


GCA 
Ala 


AAT 
Asn 


GGG 
Gly 


CGG 
Arg 
10 


TAG 


GCG 
Ala 
1 


TGT 
Cys 


ACG 
Thr 


GTG 
Val 


796 


GGA 
Gly 
5 


GGT 
Gly 


CTA 
Leu 


TAT 
Tyr 


AAG 
Lys 


CAG 
Gin 
10 


AGC 
Ser 


TCT 
Ser 


CTG 
Leu 


GCT 
Ala 


AAC 
Asn 
15 


TAG 


AGA 
Arg 
1 


ACC 
Thr 


CAC 
His 


TGC 
Cys 


344 



TTA CTG GCT TAT CGA AAT TAATACGACT CACTATAGGG AGACCCAAGC 8 92 

Leu Leu Ala Tyr Arg Asn 

\a 5 10 

il TTCGCGCGGG TACCACTCTC TTCCGCATCG CTGTCTGCGA GGGCCAGCTG TTGGGCTCGC 952 

^ GGTTGAGGAC AAACTCTTCG CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT 1012 

CCGAACGGTA CTCCGCCACC GAGGGACCTG AGCGAGTCCG CATCGACCGG ATCGGAAAAC 1072 

CTCTCGAGGA ACTGAAAAAC CAGAAAGTTA ACTGGTAAGT TTAGTCTTTT TGTCTTTTTA 113 2 

CI] TTTCAGGTCC CGGATCTGAG TTAGGGCGGG ACATGGGCGG AGTTAGGGGC GGGACTATGG 1192 

Ell, TTGCTGACTA ATTGAGATGC ATGCTTTGCA TACTTCTGCC TGCTGGGGAG CCTGGGGACT 12 52 

TTCCACACCT GGTTGCTGAC TAATTGAGAT GCATGCTTTG CATACTTCTG CCTGCTGGGG 1312 

^ AGCCTGGGGA CTTTCCACAC CCTAACTGAC ACACATTCCA CAGCTGGTTC TTTCAGATCC 13 72 

GGTGGTGGTG CAAATCAAAG AACTGCTCCT CAGTGGATGT TGCCTTTACT TCTAGGCCTG 14 3 2 

TACGGAAGTG TTACTTCTGC TCTAAAAGCT GCGGAATTGT ACCCGCGGCC GCTGCAGTCT 14 92 

AGACGAATTC GCGTACGATA TCGATGGGCC CTATT CTA TAG TGT CAC CTA AAT 154 5 

Leu Cys His Leu Asn 

1 1 



GCTAGAGCTC 


GCTGATCAGC 


CTCGACTGTG 


CCTTCTAGTT 


GCCAGCCATC 


TGTTGTTTGC 


1605 


CCCTCCCCCG 


TGCCTTCCTT 


GACCCTGGAA 


GGTGCCACTC 


CCACTGTCCT 


TTCCTAATAA 


1665 


AATGAGGAAA 


TTGCATCGCA 


TTGTCTGAGT 


AGGTGTCATT 


CTATTCTGGG 


GGGTGGGGTG 


1725 


GGGCAGGACA 


GCAAGGGGGA 


GGATTGGGAA 


GACAATAGCC 


GAAATGACCG 


ACCAAGCGAC 


1785 


GCCCAACCTG 


CCATCACGAG 


ATTTCGATTC 


CACCGCCGCC 


TTCTATGAAA 


GGTTGGGCTT 


1845 


CGGAATCGTT 


TTCCGGGACG 


CCGGCTGGAT 


GATCCTCCAG 


CGCGGGGATC 


TCATGCTGGA 


1905 
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GTTCTTCGCC 


CACCCCAACT 


TGTTTATTGC 


AGCTTATAAT 


GGTTACAAAT 


AAAGCAATAG 


1965 


CATCACAAAT 


TTCACAAATA 


AAGCATTTTT 


TTCACTGCAT 


TCTAGTTGTG 


GTTTGTCCAA 


2025 


ACTCATCAAT 


GTATCTTATC 


ATGTCTGTAT 


ACCGTCGACC 


TCTAGCTAGA 


GCTTGGCGTA 


2085 


ATCATGGTCA 


TAGCTGTTTC 


CTGTGTGAAA 


TTGTTATCCG 


CTCACAATTC 


CACACAACAT 


2145 


ACGAGCCGGA 


AGCATAAAGT 


GTAAAGCCTG 


GGGTGCCTAA 


TGAGTGAGCT 


AACTCACATT 


2205 


AATTGCGTTG 


CGCTCACTGC 


CCGCTTTCCA 


GTCGGGAAAC 


CTGTCGTGCC 


AGCTGCATTA 


226 5 


ATGAATCGGC 


CAACGCGCGG 


GGAGAGGCGG 


TTTGCGTATT 


GGGCGCTCTT 


CCGCTTCCTC 


2325 


GCTCACTGAC 


TCGCTGCGCT 


CGGTCGTTCG 


GCTGCGGCGA 


GCGGTATCAG 


CTCACTCAAA 


2385 


GGCGGTAATA 


CGGTTATCCA 


CAGAATCAGG 


GGATAACGCA 


GGAAAGAACA 


TGTGAGCAAA 


2445 


AGGCCAGCAA 


AAGGCCAGGA 


ACCGTAAAAA 


GGCCGCGTTG 


CTGGCGTTTT 


TCCATAGGCT 


2505 


CCGCCCCCCT 


GACGAGCATC 


ACAAAAATCG 


ACGCTCAAGT 


CAGAGGTGGC 


GAAACCCGAC 


2565 


AGGACTATAA 


AGATACCAGG 


CGTTTCCCCC 


TGGAAGCTCC 


CTCGTGCGCT 


CTCCTGTTCC 


2625 


GACCCTGCCG 


CTTACCGGAT 


ACCTGTCCGC 


CTTTCTCCCT 


TCGGGAAGCG 


TGGCGCTTTC 


2685 


TCAATGCTCA 


CGCTGTAGGT 


ATCTCAGTTC 


GGTGTAGGTC 


GTTCGCTCCA 


AGCTGGGCTG 


2745 


TGTGCACGAA 


CCCCCCGTTC 


AGCCCGACCG 


CTGCGCCTTA 


TCCGGTAACT 


ATCGTCTTGA 


2805 


GTCCAACCCG 


GTAAGACACG 


ACTTATCGCC 


ACTGGCAGCA 


GCCACTGGTA 


ACAGGATTAG 


2865 


CAGAGCGAGG 


TATGTAGGCG 


GTGCTACAGA 


GTTCTTGAAG 


TGGTGGCCTA 


ACTACGGCTA 


2925 


CACTAGAAGG 


ACAGTATTTG 


GTATCTGCGC 


TCTGCTGAAG 


CCAGTTACCT 


TCGGAAAAAG 


2985 


AGTTGGTAGC 


TCTTGATCCG 


GCAAACAAAC 


CACCGCTGGT 


AGCGGTGGTT 


TTTTTGTTTG 


3045 


CAAGCAGCAG 


ATTACGCGCA 


GAAAAAAAGG 


ATCTCAAGAA 


GATCCTTTGA 


TCTTTTCTAC 


3105 


GGGGTCTGAC 


GCTCAGTGGA 


ACGAAAACTC 


ACGTTAAGGG 


ATTTTGGTCA 


TGAGATTATC 


3165 


AAAAAGGATC 


TTCACCTAGA 


TCCTTTTAAA 


TTAAAAATGA 


AGTTTTAAAT 


CAATCTAAAG 


3225 


TATATATGAG 


TAAACTTGGT 


CTGACAGTTA 


CCAATGCTTA 


ATCAGTGAGG 


CACCTATCTC 


3285 


AGCGATCTGT 


CTATTTCGTT 


CATCCATAGT 


TGCCTGACTC 


CCCGTCGTGT 


AGATAACTAC 


3345 


GATACGGGAG 


GGCTTACCAT 


CTGGCCCCAG 


TGCTGCAATG 


ATACCGCGAG 


ACCCACGCTC 


3405 


ACCGGCTCCA 


GATTTATCAG 


CAATAAACCA 


GCCAGCCGGA 


AGGGCCGAGC 


GCAGAAGTGG 


3465 


TCCTGCAACT 


TTATCCGCCT 


CCATCCAGTC 


tattaattgt" TGCCGGGAAG 


CTAGAGTAAG 


3525 


TAGTTCGCCA 


GTTAATAGTT 


TGCGCAACGT 


TGTTGCCATT 


GCTACAGGCA 


. TCGTGGTGTC 


3585 


ACGCTCGTCG 


TTTGGTATGG 


CTTCATTCAG 


CTCCGGTTCC 


CAACGATCAA 


GGCGAGTTAC 


3645 


ATGATCCCCC 


ATGTTGTGCA 


AAAAAGCGGT 


TAGCTCCTTC 


GGTCCTCCGA 


TCGTTGTCAG 


3705 


AAGTAAGTTG 


GCCGCAGTGT 


TATCACTCAT 


GGTTATGGCA 


GCACTGCATA 


ATTCTCTTAC 


3765 
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TGTCATGCCA TCCGTAAGAT GCTTTTCTGT GACTGGTGAG TACTCAACCA AGTCATTCTG 38 2 5 

AGAATAGTGT ATGCGGCGAC CGAGTTGCTC TTGCCCGGCG TCAATACGGG ATAATACCGC 38 8 5 

GCCACATAGC AGAACTTTAA AAGTGCTCAT CATTGGAAAA CGTTCTTCGG GGCGAAAACT 3 94 5 

CTCAAGGATC TTACCGCTGT TGAGATCCAG TTCGATGTAA CCCACTCGTG CACCCAACTG 4 0 05 

ATCTTCAGCA TCTTTTACTT TCACCAGCGT TTCTGGGTGA GCAAAAACAG GAAGGCAAAA 4 065 

TGCCGCAAAA AAGGGAATAA GGGCGACACG GAAATGTTGA ATACTCATAC TCTTCCTTTT 4125 

TCAATATTAT TGAAGCATTT ATCAGGGTTA TTGTCTCATG AGCGGATACA TATTTGAATG 418 5 

TATTTAGAAA AATAAACAAA TAGGGGTTCC GCGCACATTT CCCCGAAAAG TGCCACCTGA 424 5 

CGTC 424 9 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
Vy (D) TOPOLOGY: linear 

M (ii) MOLECULE TYPE: protein 

\n 

r; (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

M' Arg Cys Thr Gly Gin lie Tyr Ala Leu Thr Leu lie lie Asp 
01 1 5 10 

^==' (2) INFORMATION FOR SEQ ID NO: 35: 

in 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
vP (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Leu Leu lie Val lie Asn Tyr Gly Val lie Ser Ser 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Pro lie Tyr Gly Val Pro Arg Tyr lie Thr Tyr Gly Lys Trp Pro Ala 
15 10 15 



60 

Trp Leu Thr Ala Gin Arg Pro Pro Pro lie Asp Val Asn Asn Asp Val 
20 25 30 

Cys Ser His Ser Asn Ala Asn Arg Asp Phe Pro Leu Thr Ser Met Gly 
35 40 45 

Gly Leu Phe Thr Val Asn Cys Pro Leu Gly Ser Thr Ser Ser Val Ser 
50 55 60 

Tyr Ala Lys Tyr Ala Pro Tyr 
65 70 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 amino acids 

(B) TYPE: amino acid 
(D). TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

P (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

;4 Arg Gin 



(2) INFORMATION FOR SEQ ID NO :3a: 



'^^ (i) SEQUENCE CHARACTERISTICS: 

yl (A) LENGTH: 1 amino acids 

(B) TYPE: amino acid 
}^ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

If^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 

^1 Arg 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 9 ; 

Met Ala Arg Leu Ala Leu Cys Pro Val His Asp Leu Met Gly Leu Ser 
15 10 15 

Tyr Leu Ala Val His Leu Arg lie Ser His Arg Tyr Tyr His Gly Asp 
20 25 ' 30 

Ala val Leu Ala Val His Gin Trp Ala Trp lie Ala Val 
35 40 45 
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(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Leu Thr Gly lie Ser Lys Ser Pro Pro His 
15 10 



C3 



in 

Cn 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



Mi (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



Arg Gin Trp Glu Phe Val Leu Ala Pro Lys Ser Thr Gly Leu Ser Lys 
15 10 15 



Met Ser 



m 

(i) SEQUENCE CHARACTERISTICS: 



(2) INFORMATION FOR SEQ ID NO: 42: 



(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Gin Leu Arg Pro lie Asp Ala Asn Gly Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

Ala Cys Thr Val Gly Gly Leu Tyr Lys Gin Ser Ser Leu Ala Asn 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 

Arg Thr His Cys Leu Leu Ala Tyr Arg Asn 
15 10 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 

Leu 

1 



(■2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 

Cys His Leu Asn 
1 



